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Highly potent inhibitors of human cathepsin L identified by screening 
combinatorial pentapeptide amide collections 

Achim Brinker'- 2 Ekkehard Weber 3 . Dieter Stoll 24 , Jurgen Voigt 1 , Annett Muller 5 1, Norbert Sewald 5 . 
Gunther Jung 4 , Karl Heinz Wiesmuller 6 and Peter Bohley 1 

>Pfc *;*!n*isch chemisches Institui and 7 Natunissenschafiliches und Medizinisches an der Universitat Tubingen, Germany; 
^^^tiSSlSJch^U der Universitdt Holle-Wincnberz, Germany; <lnstitutfur Or S aniscHe CHemie, Institut der Universe 
Tubingen. Germany;' University Bielefeld, Fakuhal fur Chemie, Germany; *EMC mic recollections GmbH, Tubingen, Germany 

By screening a combinatorial pentapeptide amide collection in an inhibition assay, we systematically evaluated 
the potential of 19 proteinogenic amino acids and seven nonproteinogenic amino acids to serve as building blocks 
for inhibitors of human cathepsin L. Particularly efficient were aromatic, bulky, hydrophobic ammo- acid 
residues especially leucine, and positively charged residues, especially arginine. Building blocks for potential 
inhibitor peptides were combined by random selection from their activity pattern. This random approach for the 
design of inhibitors was introduced to compensate for the inaccuracy induced by shifted docking of combinatorial 
compound collections at the active center of cathepsin L. Thereby, we obtained structurally defined pentapeptide 
amides which inhibited human cathepsin L at nanomolar concentrations. Among the most P"*^ 
one peptide RKLLW-NH 2 , shares the amphophilic character of the nonamer fragment VMNGLQNRK oi the 
autoinhibitory, substrate-like, but reverse-binding prosegment of human cathepsin L which blocks the active 
center of the enzyme. Obviously, RKLLW-NH 2 carries the functions that are important for enzyme-peptide 
interaction in a condensed form. This hypothesis was confirmed by structure- activity studies using truncated and 
modified pentapeptides. 

Keywords: cathepsin L inhibitors; synthetic peptide library. 



Cathepsin L (EC 3.4.22.16) is one of the major papain-like 
cysteine proteases expressed in animal tissues [1,2]. It is 
ubiquitously distributed in the lysosomes of all cells where 
it has a prominent role in nonspecific intracellular protein 
breakdown [2]. Cathepsin L is synthesized as an inactive 
proenzyme preventing premature proteolytic activity. An 
N-terminal 96-amino-acid protein extension, the prosegment, 
serves as a potent and specific macromolecular inhibitor of 
immature procathepsin L [3,4]. High levels of the proenzyme 
are secreted by nontransformed cells, such as osteoclasts, 
fibroblasts and macrophages, as well as by malignantly 
transformed cells [5]. Conversion to the mature enzyme occurs 
intracellular^ in lysosomes at pH 3.0-3.5 by autocatalyuc 
removal of the prosegment [5,6], whereas extracellularly at 
pH 5 5-6 0 maturation is supported by negatively charged 
matrix surfaces [5-7] or metalloprotcases [5]. Because of its 
high activity against a wide variety of protein substrates [5], 
mature cathepsin L has physiologically important extracellular 
functions [8-11]. Human cathepsin L is of particular interest 
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because of its suggested involvement in the pathogenesis of 
rheumatoid arthritis [12], as well as in tumor invasion and 
metastasis [13]. Inhibition of the enzyme is expected to be a 
promising anticancer strategy [13-16]. Therefore, human 
cathepsin L is an interesting target for drug design and potent 
low molecular mass inhibitors are expected to be good 
candidates for therapeutic agents. 

Cry stal structures for human procathepsin L [17] and the 
mature form of cathepsin L [18] have been published. The 
enzyme shares the common fold of the papain superfamily. Its 
two domains are arranged l V'-like and enclose the central 
substrate-binding cleft. The hydrophobic S 2 -binding site forms 
the only distinct pocket [19]. Accordingly, the substrate 
specificity of cathepsin L seems to be largely determined by 
bulky hydrophobic amino acids in the P2 position of substrate 
proteins [20,21]. The inhibitory mechanism of the prosegment 
is common to all members of the papain family [4]. By 
interacting with the substrate binding cleft in the reverse 
orientation compared with natural protein substrates, the 
otherwise substrate-like prosegment blocks the fully established 
reactive center of the protease without being cleaved. 

Diverse structure-based strategies using large fragments and 
partial sequences of macromolecular inhibitors of cysteine 
proteases [3,22-24], as well as strategies involving the testing 
of a limited range of individual substrate homologs [5] have 
been followed to identify potent inhibitors of members of the 
cathepsin family of proteases. To date, however, a compre- 
hensive, nonstructure based, and therefore unbiased approach 
has yet to be reported. Combinatorial chemistry offers non- 
structure-based strategies for the study of molecular interac- 
tions [25-28]. Thus, the use of combinatorial compound 
collections is a promising approach to understand the structural 
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requirements for potent inhibitors of human cathepsin L. In the 
search for novel inhibitory peptides of human cathepsin L, we 
first determined the sequence-dependent features of proteino- 
genic and seven nonproteinogenic amino acids by testing a 
series of 130 combinatorial peptide amide collections. With one 
defined position in each collection each individua^O/X^NHs 
mixture contained a calculated number of 19 different 
peptides. In order to obtain peptide inhibitors, we combined 
amino acids at all sequence positions that were found to be 
conducive to inhibition in the respective sequence positions. 



EXPERIMENTAL PROCEDURES 

Synthesis and analysis of defined peptides and pentapeptide 
amide collections 

Synthetic pentapeptide amide collections, as well as individual 
pentapeptide amides, were prepared by fully automated solid- 
phase peptide synthesis using Fmoc/tBu chemistry and Rink 
amide 4-methylbenzhydrylamine polystyrene resins as described 
recently [29). In brief, in the case of peptide amide collections 
the introduction of randomized sequence positions (X) was 
performed in a double coupling step with premixed equimolar 
mixtures of Fmoc-L- amino acids that were used in an equimolar 
ratio with respect to the coupling sites of the resins. For 
coupling of defined sequence positions (O), a fivefold molar 
excess of single Fmoc-L-amino acids was used. An optimized 
diisopropylcarbodiimide/l-hydroxybenzotriazole method was 
used to synthesize the collections to yield close to equimolar 
mixtures [30]. Defined peptides were prepared by multiple 
peptide syntheses [3!]. W-Acetylation was performed by 
reacting the N-terminally deprotected resin-bound peptide with 
a 1 : 1 mixture of diisopropylethylamine [1.5 m in dichloro- 
methane/dimethylformamide (1:1)) and acetic anhydride 
[1.5 M in dichloromethane/dimethylformamide (1:1)] for 
30 min. To synthesize pentapeptides (C-terminal carboxy 
groups), 2-chlorotritylchloride/l %-divinylbenzene-polystyrene 
resins (Novabiochem, Bad Soden, Germany) were used. The 
identity of defined peptides was confirmed by electrospray 
ionization mass spectrometry (ES1-MS) and the purity was 
> 80% as determined by HPLC. The amino-acid composition 
in the defined position and the random sequence positions of 
0/X 4 -NH 2 peptide mixtures was determined by pool sequenc- 
ing [32] and ESI-MS [33]. Deviations from equimolar 
representation of the amino acids in randomized sequence 
positions were found to be within the error limits of the 
analytical methods. Fmoc- protected amino acids (3-leucine, 
homo-3-leucine and homo-p-isoleucine were prepared by a 
modified Amdt-Eistert synthesis [34]. 



Human cathepsin L 

Human procathepsin L was purified from the culture medium 
of the human non-small-cell lung cancer cell line EPLC 32M1 
as described elsewhere [35] and converted to mature and fully 
active single chain cathepsin L on a negatively charged surface. 
Procathepsin L was applied to a carboxymethyl-Sephadex C 
50 column equilibrated with 0.01 m sodium acetate (pH 5.5). 
Conversion to the mature enzyme occurred during elution with 
a linear gradient of NaCl (0-0.5 m). Cathepsin L-containing 
fractions were concentrated and 0.5 u-L 0.05 m HgCl 2 and 
0 5 u.L 0.1 M EDTA were added to_ 100 pJL aliquots of the 
enzyme stock solution (118 ng-mL" 1 ). These aliquots were 
stored at -70 °C until use. 



Cathepsin L inhibition assay 

The stock solution of human cathepsin L was diluted 10 000- fold 
with reactivation buffer (0.1 m NaAc, pH 5.5; 0.01% Triton-X 
100; 5 mM EDTA; 5 mM dithiothreitol) and incubated at 0 °C 
for 30 min. The inhibition assay was performed in 96-well 
assay plates. Aliquots of the activated enzyme solution (50 \xL) 
were pre-incubated in the absence or presence of putative 
peptide inhibitors (50 yiM) for 30 min at 0 °C. The reaction 
was started by the addition of substrate (10 p.L of Z-Phe-Arg-7- 
amino-4-methylcoumarine, 10 u.m in reactivation buffer) [36]. 
After 20 min at 37 °C, the reaction was terminated by the 
addition of 100 \lL stop solution (1 M NaAc, 200 mM sodium- 
monochloroacetate, pH 3.0) and fluorescence intensities 
were measured using a luminescence spectrometer (type 
LS50B, PerkinElmer, Uberlingen). All tests were performed 
in triplicate. The pentapeptide mixtures were solubilized in 
aqueous dimethylsulfoxide (1% v/v), defined peptides in 
aqueous dimethylsulfoxide (5% v/v). Extremely hydrophobic 
individual peptides were solubilized in aqueous dimethyl- 
sulfoxide (5% v/v) by the addition of Triton-X 100 (0.01 % v/-^. 
Enzyme activities were measured in the presence of 
X 5 -NH 2 collection or of O/X4-NH2 collections and were 
expressed as relative enzyme activities in percent values of 
the maximal activity achieved in the absence of peptides 
(100%). 

RESULTS 

Screening of a combinatorial peptide amide library 

To identify promising amino-acid building blocks of human 
cathepsin L inhibitors, a combinatorial pentapeptide amide 
collection was investigated in an inhibition assay against the 
enzyme. Collections represented by the formulas OXXXX- 
NH 2 , XOXXX-NH 2 , XXOXX-NH 2 , XXXOX-NH 2 and 
XXXXO-NH2 were prepared. They differed in the position of 
the defined amino acid represented by O. Randomized positions 
X reflected a mixture of 19 (cysteine excluded) of the 20 
genetically encoded amino acids (X = A,D,E,F,G,H,I,K,L,M, 
N,P,Q,R,S,T,V,W,Y). Defined positions (O) were represented by 
one of these 19 proteinogenic amino-acid residues or by one ^ f 
the seven nonproteinogenic amino acids citrullin, hydroxyp. 
line, tetrahydroisochinoline-3-carbonic acid, a-pyridylalanine, 
a-naphthylalanine, /?-nitrophenyl alanine or homophenyl alanine 
(B). Thus, each of the five sets of 0/X4-NH 2 collections 
contained 26 different subsets, each representing 19 4 individual 
peptides, for a grand total of 26 x 19 4 = 3.4 x 10 6 individual 
peptides. 

In addition, the completely randomized collection X 5 -NH 2 
was prepared and tested as a standard. X 5 -NH 2 represented the 
average inhibitory effect of all peptides present in the 
combinatorial pentapeptide amide collection made from 19 
proteinogenic amino acids. Individual 0/X4-NH 2 partial 
collections represented the inhibitory effect of the subset of 
pentapeptide amides that contained a distinct defined amino 
acid at a particular sequence position. Amino acids O useful as 
building blocks of inhibitors are expected to increase the 
inhibitory effect of 0/X4-NH 2 collections compared wilh 
X 5 -NH 2 and to cause a stronger reduction of enzyme activity. 

In the inhibition assay, all peptide amide collections were 
tested close to the IC 50 concentration of X 5 -NH 2 . At a final 
concentration of 136 jxm, X 5 -NH 2 reduced the enzyme activity 
to 48%. Therefore, 48% enzyme activity was defined as the 
reference value and was subtracted from all enzyme activities 
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achieved in the presence of O/X4-NH2 collections. The results 
from testing 130 O/X4-NH2 collections are given in Fig. 1. 
Amino acids promoting inhibition showed negative relative 
enzyme activities. Defined amino acids (O) which were 
regarded as favorable for inhibition showed relative enzyme 
activities of < -10%. vu 
Significant differences in the inhibitory' activities of 0/X 4 -NH 2 
collections were found to be dependent both on the defined 
amino-acid residue and their positions within the sequence. 
Therefore, it was possible to estimate the potential of 19 
genetically encoded and seven nongcnetically encoded amino 
acids to serve as building blocks for inhibitors of human 
cathcpsin L. The bulky aliphatic amino acids leucine and 
methionine, the aromatic amino acids phenylalanine, tyrosine 
and tryptophan, as vwcll as the positively charged amino acids 
arginine and lysine considerably improved the inhibitory effects 
when representing O within 0/X 4 -NH 2 collections. In contrast, 
the negatively charged amino acids aspartate and glutamate, the 
small aliphatic amino acids alanine and glycine, and the ammo 
acids proline, asparagine and glutamine were nonfavorable 
for inhibition when representing O in 0/X 4 -NH 2 collections. 
Interestingly, leucine and methionine promoted, whereas iso- 
leucine hindered, the inhibitory activity of 0/X4-NH 2 collec- 
tions (Fig. 1). The introduction of the phenylalanine analogs 
p-nitrophenylalanine or homophenylalanine in 0/X 4 -NH 2 
collections had no influence. The introduction of other 
nonproteinogenic amino acids such as citrullin, hydroxyproline, 
tetrahydroisochinoline-3-carbonic acid, a-pyridylalanine and 
a-naphtylalanine caused a considerable decrease in inhibitory 
activities of O/X, NH 2 collections. 

The effects described so far were not strongly influenced by 
the positions of the defined amino acids in the pentapepiide 
amides. However, in position 3, the favorable effect of leucine 
was more pronounced than in all other positions. Phenylalanine 
was the second most favorable amino acid in this position, but 
was considerably less effective than leucine. Basic amino 
acids showed no pronounced effect at position 3. In position 
5, tryptophan instead of leucine was particularly conducive to 
inhibition. 

Deduction of individual pentapeptide amide inhibitors 

The cut-off value for considering amino acids as building 
blocks of putative cathcpsin L inhibitors was set at -10% 
relative enzyme activity in the presence of the respective 
O/X4-NH2 collections. The mean activity values of nearly all 
O/X4-NH2 collections that reduced the relative enzyme activity 
to at least -10% (Fig. 1) were significantly different from the 



Fig. 1. Onc-hundrcd and thirty O/X4-NH2 collections and the X 5 NH 2 
collection were tested for inhibition of human cathcpsin L at concen- 
terations of 136 >im. The defined positions (O) contained 19 proteino^cnic 
amino acids and seven unusual amino acids. (1) citrullin. (2) hydroxy- 
proline, (3) tetrahydroisochinoline-3-carbonic acid, (4) a-pyridylalanine. 
(5) a-naphthylalanine. (6) p-nitrophenylalanine. and (7) homophenylala- 
nine. X 5 -NH 2 reduced the enzyme activiiy 10 48%. The effects of O/X4- 
NH 2 collections are shown as enzyme activates relative to the baseline 
value of 48%. Defined amino acids (O) promoting inhibition in O/X4 NH 2 
collections relative 10 the average effect achieved by X 5 -NH 2 show negative 
relative enzyme activities. Amino acids that were regarded favorable 
effected relative enzyme activities of < - 10% and are marked by empty 
bars (A-E) Results for all five groups of pentapeptide amide collections 
OXXXX-NH 2 , XOXXX-NH;. XXOXX-NH 2 , XXXOX-NH 2 and XXXXO 
NH 2 with defined amino acids at the five sequence positions. 




ADEFOHIKLMNPQR8TVWY1234567 

amino ackd» »n position 1 




ADEFGH I KL MNPQR5TVWY1 23 45B 7 
amino ac*J« in position 3 
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mean activity values of the X 5 -NH 2 collection at the 5% 
significance level (Student's Mest) with the exception of the 
O/X4-NH2 collection carrying Thr at position 4. Nevertheless, 
for the design of individual peptides, Thr at position 4 was 
considered to increase the diversity of putative cathepsin L 
inhibitors. The amino acids that reduced the relative enzyme 
activity to at least —10% are listed in Table 1. Amino acids 
with properties such as bulky hydrophobic, aromatic and basic 
were found to be active at different positions. 

The results obtained with the peptide collections used in this 
study could be influenced by shifted docking of 0/X 4 -NH 2 
pentapeptide amides. This is indicated by residues repetitively 
identified as favorable for inhibition at different positions 
(Table 1) and complicates the directed selection of amino acids 
from the screening results. To evaluate these assumptions, 
amino-acid residues for putative pentapeptide inhibitors were 
randomly selected from the amino acids listed in Table 1. Of 
numerous possible combinations, an initial set of 76 sequences 
was defined and synthesized (Table 2). Forty-nine peptides 
obeyed the rules for amino-acid residues and positions given in 
Table 1. Eighteen peptides contained tyrosine, arginine or 
lysine instead of phenylalanine or leucine at position 3. 
However, they were not among the best inhibitors, and neither 
were two control peptides with glutamine in position 5 
(Table 2). Three peptides with tryptophan in position 4 were 
prepared (Table 2; 4, 25, 32) and, unexpectedly, peptide 
number 4, RKLWL, proved to be one of the most active 
inhibitors (Table 3). Valine replacing leucine at position 5 
(peptide RKLWV) was nonfavorable for inhibition. Four 
negative controls were synthesized containing negatively 
charged amino acids (Table 2, numbers 65, 71, 72 and 76). 

In summary, peptide amides, such as RKLLW-NH 2 , that 
were composed solely of the favorable amino acids listed in 
Table 1 snowed strong activity, whereas negatively charged 
pentapeptide amides, such as EEEEE-NH 2 , did not influence 
the enzyme activity up to a concentration of 1 80 p-M (Fig. 2). 
Several inhibitors with IC50 values in the submicromolar range 
could be identified. In accordance with the results obtained by 
screening of the O/X4-NH2 collections, substitution of one 
residue within the RKLWL- NH 2 sequence by a negatively 
charged residue caused a considerable loss in the inhibition 
potential (data not shown), e.g. the IC50 value of RKLWD-NH 2 
was 18 times that of RKLWL-NH 2 . 



Table 2. Defined pentadecapeptide amides for evaluation of the 
screening results obtained with 0/X4-NH2 collections. B, homophenyl- 
alanine. 



Peptide 


Sequence 


Activity (%) 


Peptide 


Sequence 


Activ 


1 


RKLLW 


16 


39 


LWLFW 


69 


2 


LLLRW 


17 


40 


LLYTW 


70 


3 


LLLTR 


17 


41 


YFLLR 


70 


» 4 


RKLWL 


18 


42 


RLLYW 


71 


• 5 


RKLFL 


21 


43 


YFLTF 


71 


6 


LKFTR 


22 


44 


LLYTB 


75 


7 


LFLRL 


22 


45 


LRFTF 


75 


8 


LFLTR 


24 


46 


LKFTF 


76 


9 


LKLLW 


25 


47 


• RRFYV 


if. 
1 0 


10 


RLLLW 


25 


48 


LKLFW 


78 


1 1 


LRLLW 


27 
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RFLRW 


70 
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T T T TT 
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t\ VY L, I 




KA 


LKirL 


82 
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LY rvrr* 
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1 8 


I 1 YTR 
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DO 


IKI LW 


84 
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T I T I P 
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"*R 

JO 


3 / 


P LL'UD 

KrrKrJ 
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ITT TW7 
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RRYRR 


87 
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JLivLrr 
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YYT T T> 
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oz 


RFFRR 


93 


7^ 
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ai 

OJ 


1 WLl W 


95 


7ft 


T 7 FRW 


48 


04 


LlvrKr 


o c 
95 


27 


LLFLW 


48 


65 


EDEDE 




28 


LWLLW 


48 


66 


YYYYW 


96 


29 


LLYLW 


53 


67 


RFYRW 


98 


30 


YWFTF 


57 


68 


RFYRL 


98 


31 


LKLFF 


58 


69 


RWFRW 


98 


32 • 


RKLWV 


59 


70 


YWLLR 


101 


33 


YLYLF 


60 


71 


DEDED 


102 


34 


RRLTW 


62 


72 


EEEEE 


103 


35 


LLLLW 


63 


73 


LKFRB 


104 


36 


YLLFW 


63 


74 


LFYRL 


104 


37 


RRYLB 


67 


75 


RKLRQ 


105 


38 


LKFTF 


68 


76 


EQPPK 


112 



Modifications of model peptide RKLLW-NH 2 

From the most efficient inhibitors identified, the peptide amide 
RKLLW-NH 2 was chosen as a model peptide for further 
investigations. It contains the structural properties which 
supported inhibition of human cathepsin L by pentapeptide 

Table 1. Amino acids favorable for the design of cathepsin L inhibitors. 
Amino acids O in 0/X 4 -NH 2 collections that reduced the relative enzyme 
activity of at least 10% are listed in their respective sequence positions. 



Position 1 2 3 4 5 



LLLLW 
R F F , R R 

Y R F L 

K Y V 

W T F 

Y 



amides. The K- x of RKLLW-NH 2 was determined to be 130 nM, 
Whereas RKLLW-NH 2 reduced the enzyme activity to 34% at a 
concentration of 0.9 jxm (Fig. 2), the inverted peptide WLLKR- 
NH 2 showed no inhibitory activity (data not shown). To identify 
a functional minimal core structure of RKLLW- NH 2 , homo- 
logous dipeptide, tripeptide, and tetrapeptide amides were tested 

Table 3. IC 50 value of four newly defined potent pentapeptide amide 
inhibitors of cathepsin L Replacement of C-terminal leucine by aspartxc 
acid reduces activity by a factor of 18. 



Sequence IC 50 (*jl.m) 



LLLTR- NH 2 0.5 

RKLLW- NH 2 0.6 

LFLTR -NH 2 0.8 

RKLWL-NH 2 0.8 

RKLWD-NHj 14.0 
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Fig- 2. Titration of the two individual pentapcptide amides RKLLW- 
NH 2 and EEHEE-NH 2 in a cathepsin L inhibition assay. RKLLW NH 2 
consists of favorable amino acids only and shows an lCso of 0.6 u.M. 
whereas the highly negatively charged peptide EEEEE-NH 2 contains only 
the nonfavorable glutamate and has no effect on enzyme activity up to a 
concentration of 180 p-M. 

for their inhibitory efficiency. However, none of the truncations 
of RKLLW-NH 2 showed comparable inhibition at a concentra- 
tion of 0.9 ijlm (data not shown). Investigations of terminally 
modified RKLLW-NH 2 , such as Ac- RKLLW- NH2 and 
RKLLW-COOH, indicated that the positively charged N-terminus 
promotes the enzyme-peptide interactions, whereas a nega- 
tively charged C-tcrminus is nonfavorable (Fig. 3A). 

Successive substitutions of all five amino acids in RKLLW- 
NH 2 for alanine (alanine scan) were carried out to evaluate the 
contribution of the distinct amino-acid residues to the activity 
of the pentapcptide. While substitutions of most residues in 
RKLLW- NH 2 for alanine had no striking effects on the activity 
of the resulting pentapcptide amide, the substitution of leucine 
in position three strongly decreased the activity of this 
particular inhibitor of cathepsin L (Fig. 3B). Substitutions of 
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Fig. 3. Influences of N-tcrminal and C-terminal modifications of 
RKLLW NH 2 evaluated by testing Ac-RKLLW-NH 2 and RKLLW- 
COOH for inhibition of human cathepsin L (A) and Ala-scan of all 
sequence positions of RKLLW- NH 2 to evaluate the contribution of each 
individual amino acid to inhibition (B). All pentapeptide amides were 
tested at a concentration of 0.9 jxM. 

one L-amino acid for the respective D-amino-acid residue were 
prepared as were all D-pcptides to investigate the stereospeci- 
ficity of the peptide-enzyme interaction. For all D or individual 
D substitutions, at a concentration of 0.9 jxm (data not shown), 
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Fig. 4. Substitution of both leucine residues of RKLLW- NH 2 for (A) isoleucine, (B) p leucine (0L), (C) homo-f* 
p-isoleucine (hPI> All peptides were tested at a concentration of 0.9 u.M. 
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Fie 5 Investigations of RKLLW- NH 2 -dcri vat ives with substitutions in 
position 5 to further evaluate the contribution of Trp5 to the inhibitory 
activity of the model peptide. AH peptides were tested at a concentration 
of 0.9 m-M- 

no inhibitory activity was observed. Reversing the N-terminal 
amino acids RK to KR had only a marginal influence on the 
inhibitory effect of the peptide. 

The striking impact of Leu3 in RKLLW-NH, found in the 
alanine scan, was further confirmed by simultaneous and 
successive substitutions of leucine residues for isoleucme 
(Fig 4A) and p-leucine (Fig. 4B). In both cases, a pronounced 
decrease in the biological activity of the peptide was observed. 
Whereas RKL1W-NH 2 and RKL£LW-NH 2 showed considerable 
inhibitory effects at a concentration of 0.9 p-M, RKIIW-NH 2 
and RWLW NH 2 as well as RK P L P LW-NH 2 and RK P LLVV- 
NH* had no influence at all on enzyme activity. Homo-p- 
leucine was also substituted for each individually, as well as 
both simultaneously in the case of the leucine residue in 
Rl^L^NH, Neither RKLh P IW-NH 2 , RKh P lh P IW-NH 2 nor 
RKLBhIW-NH 2 showed significant inhibitory effects at 0.9 u.M 
(Fie 4C) Similarly, the introduction of homo-p-isoleucine 
completely abolished the inhibitory effects of ™dified 
pentapeptide amides at a concentration of 0.9 u.M (Fig. 4D). 
Replacement of Trp5 in RKLLW-NH 2 by phenylalanine or 
tyrosine or by glutamine considerably reduced the inhibitory 
activity (Fig. 5). 



DISCUSSION 

A combinatorial pentapeptide amide collection, 0/X 4 -NH 2 , was 
used to identify the bulky hydrophobic amino acid leucine, the 
aromatic amino acids phenylalanine, homophenylalamne, 
tyrosine and tryptophan, and also the basic amino acids 
areinine and lysine as potential building blocks for the design 
of inhibitors against human cathepsin L. The discovery of 
leucine and the aromatic amino acids is in good agreement with 
the substrate specificity of cathepsin L. Primary cleavage sites 
in physiologically relevant protein substrates, such as glucagon 
and the insulin P chain, are largely determined by ' bulky 
hydrophobic and aromatic amino acids in the S2 site [MUX). 
Potent reversible peptide inhibitors leupeptm (Ac-Leu-Leu- 
Arg-CHO), peptide aldehyde semicarbazones and peptidyl- 
methylketones [5] contain leucine and aromatic amino adds 

Calculations of the electrostatic potential distribution on the 
enzyme surface revealed a strongly negative potential lobe 
surrounding the whole substrate binding site of human 
cathepsin L. The negative potential lobe and the negatively 
charged enzyme surface in binding distance around the 
substrate binding cleft, immediately explained the overall 
favorable effect of positive charges on the inhibitory properUes 



© FEBS 2000 x ' 

of peptide inhibitors. This concerns K/X4-NH 2 or R/X 4 -NH 2 
collections with defined basic residues as well as defined 
peptides with multiple positive charges in side chains and 
unprotected N-termini, such as RKLLW-NH 2 or RKLWL-NH 2 . 
Negative charges on peptides in the side chains or at the 
C-terminus should accordingly have strongly adverse overall 
effects on inhibition activities. 

In combinatorial 0/XrNH 2 collections, hydrophobic, aro- 
matic and positively charged side chains of defined (O) amino 
acids appeared to support the inhibition activities independent 
of their relative sequence position (Table 1). However, leucme 
and, to a lesser extend, phenylalanine at position 3 where basic 
ammo-acid residues did not show considerable inhibitory 
activity was extraordinarily effective. The obvious importance 
of the third sequence position was further corroborated by 
studies involving modifications of the model inhibitor RKLLW- 
NH 2 Nevertheless, the general tendency indicated a hetero- 
genous binding of the 0/X 4 -NH 2 collections relative to the 
enzyme enabeled by shifted docking of 0/X„-NH 2 collections. 
Therefore, it was not promising to deduce the optimal relative 
sequence of the three types of building blocks merely from in- 
most striking results from the inhibition assays. Even though, 
by arbitrary combination of the favorable bulky hydrophobic, 
aromatic and basic amino acids, we obtained several individual 
pentapeptide amides with inhibitory effects against human 
cathepsin L at nanomolar concentrations. Thus the new com- 
pounds (Table 3) rank among the best noncovalently binding 
low-molecular mass inhibitors of human cathepsin L [5,22]. 
The most effective inhibitors do not share a common sequence 
motif. Peptides like RKLLW-NH 2 and RKLWL-NH 2 accumu- 
late three positive charges in their N-terminal regions, whereas 
peptides like LLLTR-NH 2 or LFLTR-NH 2 contain a basic 
residue at the C-terminal position and hydrophobic residues in 
the N-terminal part. These peptides obviously represent 
different lead structures, which is in good agreement with the 
broad substrate specificity of cathepsin L [5]. Leucine can be 
substituted for phenylalanine without substantially altering the 
inhibitory activity of the peptide. Peptides LFLTR-NH 2 and 
LLLTR-NH 2 reflect observations made with the reverse 
oriented inhibitory prosegment. In the mutated form Leu78p 
could replace wild-type Phe78p and still fit into the S2 pocket 

11 One particularly potent inhibitor, RKLLW- NH 2 , compris 
positively charged and neutral amino acids in a manner similar 
to the reverse binding nonamer prosegment sequence 
VMNGLQNRK (residues 74-82) which is located in the 
substrate binding site of mutated human procathepsin L [17]. 
RKLLW- NH 2 shares the amphiphilic character of the nonamer 
fragment as well as a bulky hydrophobic leucine anchor for 
insertion into the S2 pocket. The three positive charges of 
RKLLW- NH 2 interact strongly with the negative surface 
potential around the binding cleft of human cathepsin L and 
particularly with the rather undefined S3/S4 binding region. 
Interestingly, the prosegment contains an extraordmary accumu- 
lation of five positive charges in residues 81-87 (RKPRKGK) 
which confirm the importance of basic building blocks for 
cathepsin L inhibitors in sites other than SI' [36]. The 
prosegment-based model is in agreement with our experimental 
data confirming the prominent leucine anchor at the S2-binding 
pocket opposed to Leu4 that points toward the solvent. 

Interestingly, fragments taken from conserved sequences of 
cystatins, macromolecular inhibitors of cathepsin L and other 
papain-like cysteine proteases, have been tested in the course of 
rational, structure-based approaches for their capacity to act as 
enzyme inhibitors [21,22]. However, these isolated fragments 
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adopted a binding mode completely different to the homo- 
logous sequences in the macromolecular prosegment [22]. This 
has been shown for cathepsin B with overlapping pentadeca- 
peptides that inhibit the enzyme in the micromolar range [37] 
und for irreversible inhibitors of human cathepsin B containing 
fragments of the cathepsin B prosegment 123]. Synergic to 
structure-based inhibitor design combinatorial compound 
collections applied in this study offer a rapid method for 
selection of building blocks for efficient reversible inibitors of 
human cathepsin L as shown for the lead structure RKLLW- 
NH 2 Results from screening 0/X 4 -NH 2 collections, show that 
from a total of 1 30-amino-acid residue- sequence position 
combinations nine amino acids were positively ranked for 
one or several sequence positions. The fuzzy positioning was 
interpreted as shifted docking of amino acids O in 0/X 4 -NH 2 
collections and was taken into consideration by the unvalued 
selection of the building blocks for novel inhibitors of human 
cathepsin L. 
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